Overview

Dataset info

Number of variables28
Number of observations5043
Missing cells2191 (1.6%)
Duplicate rows45 (0.9%)
Total size in memory5.1 MiB
Average record size in memory1.0 KiB

Variables types

NUM16
CAT11
URL1

Reproduction info

Date of analysis2020-03-10 18:59:09.625040
Versionpandas-profiling v2.4.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download Configurationconfig.yaml

Warnings

Dataset has 45 (0.9%) duplicate rows Warning
actor_1_name has a high cardinality: 2098 distinct values Warning
actor_2_fb_likes has 55 (1.1%) zeros Zeros
actor_2_name has a high cardinality: 3033 distinct values Warning
actor_3_fb_likes has 89 (1.8%) zeros Zeros
actor_3_name has a high cardinality: 3522 distinct values Warning
aspect_ratio has 329 (6.5%) missing values Missing
budget has 403 (8.0%) missing values Missing
budget is highly skewed (γ1 = 48.59279525) Skewed
content_rating has 303 (6.0%) missing values Missing
country has a high cardinality: 66 distinct values Warning
director_fb_likes has 104 (2.1%) missing values Missing
director_fb_likes has 907 (18.0%) zeros Zeros
director_name has 104 (2.1%) missing values Missing
director_name has a high cardinality: 2399 distinct values Warning
facenumber_in_poster has 2152 (42.7%) zeros Zeros
genres has a high cardinality: 914 distinct values Warning
gross has 466 (9.2%) missing values Missing
movie_fb_likes has 2181 (43.2%) zeros Zeros
movie_title has a high cardinality: 4917 distinct values Warning
plot_keywords has 153 (3.0%) missing values Missing
plot_keywords has a high cardinality: 4761 distinct values Warning
title_year has 108 (2.1%) missing values Missing
cast_total_fb_likes is highly correlated with actor_1_fb_likesHigh Correlation
actor_1_fb_likes is highly correlated with cast_total_fb_likesHigh Correlation

Variables

actor_1_fb_likes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count879
Unique (%)17.4%
Missing7
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean6560.047061
Minimum0
Maximum640000
Zeros26
Zeros (%)0.5%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile95.5
Q1614
median988
Q311000
95-th percentile24000
Maximum640000
Range640000
Interquartile range (IQR)10386

Descriptive statistics

Standard deviation15020.75912
Coefficient of variation (CV)2.289733439
Kurtosis683.5473559
Mean6560.047061
Median Absolute Deviation (MAD)7727.675203
Skewness19.12177638
Sum33036397
Variance225623204.5
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1000 449 8.9%
 
11000 211 4.2%
 
2000 197 3.9%
 
3000 155 3.1%
 
12000 135 2.7%
 
13000 127 2.5%
 
14000 123 2.4%
 
10000 112 2.2%
 
18000 109 2.2%
 
22000 82 1.6%
 
Other values (868) 3336 66.2%
 
ValueCountFrequency (%) 
0 26 0.5%
 
2 8 0.2%
 
3 4 0.1%
 
4 2 < 0.1%
 
5 7 0.1%
 
ValueCountFrequency (%) 
640000 1 < 0.1%
 
260000 3 0.1%
 
164000 2 < 0.1%
 
137000 2 < 0.1%
 
87000 8 0.2%
 

actor_1_name
Categorical

HIGH CARDINALITY
Distinct count2098
Unique (%)41.6%
Missing7
Missing (%)0.1%
Memory size39.5 KiB
Robert De Niro
 
49
Johnny Depp
 
41
Nicolas Cage
 
33
J.K. Simmons
 
31
Matt Damon
 
30
Other values (2092)
4852
ValueCountFrequency (%) 
Robert De Niro 49 1.0%
 
Johnny Depp 41 0.8%
 
Nicolas Cage 33 0.7%
 
J.K. Simmons 31 0.6%
 
Matt Damon 30 0.6%
 
Bruce Willis 30 0.6%
 
Denzel Washington 30 0.6%
 
Liam Neeson 29 0.6%
 
Harrison Ford 27 0.5%
 
Steve Buscemi 27 0.5%
 
Other values (2087) 4709 93.4%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length27
Mean length13.1782669
Min length3
Scatter

actor_2_fb_likes
Real number (ℝ≥0)

ZEROS
Distinct count918
Unique (%)18.2%
Missing13
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean1651.754473
Minimum0
Maximum137000
Zeros55
Zeros (%)1.1%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile26
Q1281
median595
Q3918
95-th percentile11000
Maximum137000
Range137000
Interquartile range (IQR)637

Descriptive statistics

Standard deviation4042.438863
Coefficient of variation (CV)2.447360627
Kurtosis256.7951889
Mean1651.754473
Median Absolute Deviation (MAD)1979.395883
Skewness9.884733179
Sum8308325
Variance16341311.96
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1000 309 6.1%
 
11000 111 2.2%
 
2000 100 2.0%
 
3000 76 1.5%
 
0 55 1.1%
 
10000 47 0.9%
 
14000 41 0.8%
 
13000 40 0.8%
 
826 37 0.7%
 
4000 34 0.7%
 
Other values (907) 4180 82.9%
 
ValueCountFrequency (%) 
0 55 1.1%
 
2 14 0.3%
 
3 14 0.3%
 
4 12 0.2%
 
5 10 0.2%
 
ValueCountFrequency (%) 
137000 1 < 0.1%
 
29000 1 < 0.1%
 
27000 2 < 0.1%
 
25000 3 0.1%
 
23000 6 0.1%
 

actor_2_name
Categorical

HIGH CARDINALITY
Distinct count3033
Unique (%)60.1%
Missing13
Missing (%)0.3%
Memory size39.5 KiB
Morgan Freeman
 
20
Charlize Theron
 
15
Brad Pitt
 
14
James Franco
 
11
Meryl Streep
 
11
Other values (3027)
4959
ValueCountFrequency (%) 
Morgan Freeman 20 0.4%
 
Charlize Theron 15 0.3%
 
Brad Pitt 14 0.3%
 
James Franco 11 0.2%
 
Meryl Streep 11 0.2%
 
Jason Flemyng 10 0.2%
 
Adam Sandler 10 0.2%
 
Steve Buscemi 9 0.2%
 
Angelina Jolie Pitt 9 0.2%
 
Bruce Willis 9 0.2%
 
Other values (3022) 4912 97.4%
 
(Missing) 13 0.3%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length28
Mean length13.0483839
Min length3
Scatter

actor_3_fb_likes
Real number (ℝ≥0)

ZEROS
Distinct count907
Unique (%)18.0%
Missing23
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean645.009761
Minimum0
Maximum23000
Zeros89
Zeros (%)1.8%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile10
Q1133
median371.5
Q3636
95-th percentile1000
Maximum23000
Range23000
Interquartile range (IQR)503

Descriptive statistics

Standard deviation1665.041728
Coefficient of variation (CV)2.581420979
Kurtosis60.56388811
Mean645.009761
Median Absolute Deviation (MAD)569.3467201
Skewness7.279020793
Sum3237949
Variance2772363.957
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1000 126 2.5%
 
0 89 1.8%
 
11000 29 0.6%
 
3 28 0.6%
 
2000 27 0.5%
 
3000 26 0.5%
 
826 22 0.4%
 
2 21 0.4%
 
4 21 0.4%
 
7 21 0.4%
 
Other values (896) 4610 91.4%
 
(Missing) 23 0.5%
 
ValueCountFrequency (%) 
0 89 1.8%
 
2 21 0.4%
 
3 28 0.6%
 
4 21 0.4%
 
5 18 0.4%
 
ValueCountFrequency (%) 
23000 2 < 0.1%
 
20000 1 < 0.1%
 
19000 5 0.1%
 
17000 1 < 0.1%
 
16000 3 0.1%
 

actor_3_name
Categorical

HIGH CARDINALITY
Distinct count3522
Unique (%)69.8%
Missing23
Missing (%)0.5%
Memory size39.5 KiB
Steve Coogan
 
8
Ben Mendelsohn
 
8
John Heard
 
8
Sam Shepard
 
7
Jon Gries
 
7
Other values (3516)
4982
ValueCountFrequency (%) 
Steve Coogan 8 0.2%
 
Ben Mendelsohn 8 0.2%
 
John Heard 8 0.2%
 
Sam Shepard 7 0.1%
 
Jon Gries 7 0.1%
 
Robert Duvall 7 0.1%
 
Lois Maxwell 7 0.1%
 
Stephen Root 7 0.1%
 
Anne Hathaway 7 0.1%
 
Kirsten Dunst 7 0.1%
 
Other values (3511) 4947 98.1%
 
(Missing) 23 0.5%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length29
Mean length13.03628792
Min length3
Scatter

aspect_ratio
Real number (ℝ≥0)

MISSING
Distinct count23
Unique (%)0.5%
Missing329
Missing (%)6.5%
Infinite0
Infinite (%)0.0%
Mean2.220403055
Minimum1.18
Maximum16
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum1.18
5-th percentile1.66
Q11.85
median2.35
Q32.35
95-th percentile2.35
Maximum16
Range14.82
Interquartile range (IQR)0.5

Descriptive statistics

Standard deviation1.385112535
Coefficient of variation (CV)0.6238113087
Kurtosis90.65322055
Mean2.220403055
Median Absolute Deviation (MAD)0.4004107589
Skewness9.390056312
Sum10466.98
Variance1.918536735
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2.35 2360 46.8%
 
1.85 1906 37.8%
 
1.78 110 2.2%
 
1.37 100 2.0%
 
1.33 68 1.3%
 
1.66 64 1.3%
 
16 45 0.9%
 
2.2 15 0.3%
 
2.39 15 0.3%
 
4 7 0.1%
 
Other values (12) 24 0.5%
 
(Missing) 329 6.5%
 
ValueCountFrequency (%) 
1.18 1 < 0.1%
 
1.2 1 < 0.1%
 
1.33 68 1.3%
 
1.37 100 2.0%
 
1.44 1 < 0.1%
 
ValueCountFrequency (%) 
16 45 0.9%
 
4 7 0.1%
 
2.76 3 0.1%
 
2.55 2 < 0.1%
 
2.4 3 0.1%
 

budget
Real number (ℝ≥0)

MISSING
SKEWED
Distinct count443
Unique (%)8.8%
Missing403
Missing (%)8.0%
Infinite0
Infinite (%)0.0%
Mean39386466.73
Minimum218
Maximum1.22155e+10
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum218
5-th percentile500000
Q16000000
median20000000
Q344000000
95-th percentile130000000
Maximum1.22155e+10
Range1.221549978e+10
Interquartile range (IQR)38000000

Descriptive statistics

Standard deviation204181882.1
Coefficient of variation (CV)5.184061916
Kurtosis2774.956653
Mean39386466.73
Median Absolute Deviation (MAD)37365363.95
Skewness48.59279525
Sum1.827532056e+11
Variance4.1690241e+16
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
20000000 179 3.5%
 
30000000 147 2.9%
 
15000000 146 2.9%
 
25000000 143 2.8%
 
10000000 143 2.8%
 
40000000 134 2.7%
 
35000000 120 2.4%
 
5000000 111 2.2%
 
50000000 106 2.1%
 
12000000 95 1.9%
 
Other values (432) 3316 65.8%
 
(Missing) 403 8.0%
 
ValueCountFrequency (%) 
218 1 < 0.1%
 
1100 1 < 0.1%
 
1400 1 < 0.1%
 
3250 1 < 0.1%
 
4500 1 < 0.1%
 
ValueCountFrequency (%) 
1.22155e+10 1 < 0.1%
 
4200000000 1 < 0.1%
 
2500000000 1 < 0.1%
 
2400000000 1 < 0.1%
 
2127519898 1 < 0.1%
 

cast_total_fb_likes
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count3978
Unique (%)78.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9699.063851
Minimum0
Maximum656730
Zeros33
Zeros (%)0.7%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile179
Q11411
median3090
Q313756.5
95-th percentile36927.7
Maximum656730
Range656730
Interquartile range (IQR)12345.5

Descriptive statistics

Standard deviation18163.79912
Coefficient of variation (CV)1.872737349
Kurtosis361.2551153
Mean9699.063851
Median Absolute Deviation (MAD)10152.51874
Skewness12.83192773
Sum48912379
Variance329923598.6
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.00000e+00 1.00000e+00 8.15000e+01 2.76450e+03 3.31150e+03 ... 5.40790e+04 6.46985e+04 9.22280e+04 1.55193e+05 6.56730e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 33 0.7%
 
5 7 0.1%
 
2020 6 0.1%
 
2 6 0.1%
 
1044 5 0.1%
 
673 5 0.1%
 
29 5 0.1%
 
2321 4 0.1%
 
1554 4 0.1%
 
646 4 0.1%
 
Other values (3968) 4964 98.4%
 
ValueCountFrequency (%) 
0 33 0.7%
 
2 6 0.1%
 
3 1 < 0.1%
 
4 2 < 0.1%
 
5 7 0.1%
 
ValueCountFrequency (%) 
656730 1 < 0.1%
 
303717 1 < 0.1%
 
283939 1 < 0.1%
 
263584 1 < 0.1%
 
261818 1 < 0.1%
 

color
Categorical

Distinct count3
Unique (%)0.1%
Missing19
Missing (%)0.4%
Memory size39.5 KiB
Color
4815
Black and White
 
209
ValueCountFrequency (%) 
Color 4815 95.5%
 
Black and White 209 4.1%
 
(Missing) 19 0.4%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length16
Mean length5.44834424
Min length3
Scatter

content_rating
Categorical

MISSING
Distinct count19
Unique (%)0.4%
Missing303
Missing (%)6.0%
Memory size39.5 KiB
R
2118
PG-13
1461
PG
701
Not Rated
 
116
G
 
112
Other values (13)
 
232
ValueCountFrequency (%) 
R 2118 42.0%
 
PG-13 1461 29.0%
 
PG 701 13.9%
 
Not Rated 116 2.3%
 
G 112 2.2%
 
Unrated 62 1.2%
 
Approved 55 1.1%
 
TV-14 30 0.6%
 
TV-MA 20 0.4%
 
TV-PG 13 0.3%
 
Other values (8) 52 1.0%
 
(Missing) 303 6.0%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length9
Mean length2.825104105
Min length1
Scatter

country
Categorical

HIGH CARDINALITY
Distinct count66
Unique (%)1.3%
Missing5
Missing (%)0.1%
Memory size39.5 KiB
USA
3807
UK
 
448
France
 
154
Canada
 
126
Germany
 
97
Other values (60)
 
406
ValueCountFrequency (%) 
USA 3807 75.5%
 
UK 448 8.9%
 
France 154 3.1%
 
Canada 126 2.5%
 
Germany 97 1.9%
 
Australia 55 1.1%
 
India 34 0.7%
 
Spain 33 0.7%
 
China 30 0.6%
 
Japan 23 0.5%
 
Other values (55) 231 4.6%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length20
Mean length3.488796351
Min length2
Scatter

director_fb_likes
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count436
Unique (%)8.6%
Missing104
Missing (%)2.1%
Infinite0
Infinite (%)0.0%
Mean686.5092124
Minimum0
Maximum23000
Zeros907
Zeros (%)18.0%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q17
median49
Q3194.5
95-th percentile973
Maximum23000
Range23000
Interquartile range (IQR)187.5

Descriptive statistics

Standard deviation2813.328607
Coefficient of variation (CV)4.098020181
Kurtosis27.25628935
Mean686.5092124
Median Absolute Deviation (MAD)1069.818414
Skewness5.22970117
Sum3390669
Variance7914817.85
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 907 18.0%
 
3 70 1.4%
 
6 66 1.3%
 
7 64 1.3%
 
2 63 1.2%
 
4 60 1.2%
 
11 59 1.2%
 
10 53 1.1%
 
8 52 1.0%
 
5 52 1.0%
 
Other values (425) 3493 69.3%
 
(Missing) 104 2.1%
 
ValueCountFrequency (%) 
0 907 18.0%
 
2 63 1.2%
 
3 70 1.4%
 
4 60 1.2%
 
5 52 1.0%
 
ValueCountFrequency (%) 
23000 1 < 0.1%
 
22000 8 0.2%
 
21000 10 0.2%
 
20000 1 < 0.1%
 
18000 4 0.1%
 

director_name
Categorical

MISSING
HIGH CARDINALITY
Distinct count2399
Unique (%)47.6%
Missing104
Missing (%)2.1%
Memory size39.5 KiB
Steven Spielberg
 
26
Woody Allen
 
22
Martin Scorsese
 
20
Clint Eastwood
 
20
Ridley Scott
 
17
Other values (2393)
4834
ValueCountFrequency (%) 
Steven Spielberg 26 0.5%
 
Woody Allen 22 0.4%
 
Martin Scorsese 20 0.4%
 
Clint Eastwood 20 0.4%
 
Ridley Scott 17 0.3%
 
Tim Burton 16 0.3%
 
Steven Soderbergh 16 0.3%
 
Spike Lee 16 0.3%
 
Renny Harlin 15 0.3%
 
Oliver Stone 14 0.3%
 
Other values (2388) 4757 94.3%
 
(Missing) 104 2.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length32
Mean length12.87685901
Min length3
Scatter

duration
Real number (ℝ≥0)

Distinct count192
Unique (%)3.8%
Missing15
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean107.201074
Minimum7
Maximum511
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum7
5-th percentile81
Q193
median103
Q3118
95-th percentile146
Maximum511
Range504
Interquartile range (IQR)25

Descriptive statistics

Standard deviation25.19744081
Coefficient of variation (CV)0.235048399
Kurtosis22.56579716
Mean107.201074
Median Absolute Deviation (MAD)16.81590041
Skewness2.339134041
Sum539007
Variance634.9110233
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
90 161 3.2%
 
100 141 2.8%
 
101 139 2.8%
 
98 135 2.7%
 
97 131 2.6%
 
93 129 2.6%
 
95 124 2.5%
 
99 124 2.5%
 
94 124 2.5%
 
96 113 2.2%
 
Other values (181) 3707 73.5%
 
ValueCountFrequency (%) 
7 2 < 0.1%
 
11 1 < 0.1%
 
14 1 < 0.1%
 
20 1 < 0.1%
 
22 7 0.1%
 
ValueCountFrequency (%) 
511 1 < 0.1%
 
334 1 < 0.1%
 
330 1 < 0.1%
 
325 1 < 0.1%
 
300 1 < 0.1%
 

facenumber_in_poster
Real number (ℝ≥0)

ZEROS
Distinct count20
Unique (%)0.4%
Missing13
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean1.371172962
Minimum0
Maximum43
Zeros2152
Zeros (%)42.7%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum43
Range43
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.01357592
Coefficient of variation (CV)1.468506144
Kurtosis52.03373533
Mean1.371172962
Median Absolute Deviation (MAD)1.357893277
Skewness4.384765939
Sum6897
Variance4.054487986
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 2152 42.7%
 
1 1251 24.8%
 
2 716 14.2%
 
3 380 7.5%
 
4 207 4.1%
 
5 114 2.3%
 
6 76 1.5%
 
7 48 1.0%
 
8 37 0.7%
 
9 18 0.4%
 
Other values (9) 31 0.6%
 
(Missing) 13 0.3%
 
ValueCountFrequency (%) 
0 2152 42.7%
 
1 1251 24.8%
 
2 716 14.2%
 
3 380 7.5%
 
4 207 4.1%
 
ValueCountFrequency (%) 
43 1 < 0.1%
 
31 1 < 0.1%
 
19 1 < 0.1%
 
15 6 0.1%
 
14 1 < 0.1%
 

genres
Categorical

HIGH CARDINALITY
Distinct count914
Unique (%)18.1%
Missing0
Missing (%)0.0%
Memory size39.5 KiB
Drama
 
236
Comedy
 
209
Comedy|Drama
 
191
Comedy|Drama|Romance
 
187
Comedy|Romance
 
158
Other values (909)
4062
ValueCountFrequency (%) 
Drama 236 4.7%
 
Comedy 209 4.1%
 
Comedy|Drama 191 3.8%
 
Comedy|Drama|Romance 187 3.7%
 
Comedy|Romance 158 3.1%
 
Drama|Romance 152 3.0%
 
Crime|Drama|Thriller 101 2.0%
 
Horror 71 1.4%
 
Action|Crime|Drama|Thriller 68 1.3%
 
Action|Crime|Thriller 65 1.3%
 
Other values (904) 3605 71.5%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length64
Mean length20.31310728
Min length5
Scatter

gross
Real number (ℝ≥0)

MISSING
Distinct count4433
Unique (%)87.9%
Missing466
Missing (%)9.2%
Infinite0
Infinite (%)0.0%
Mean45222609.81
Minimum113
Maximum760505847
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum113
5-th percentile49525.4
Q13792188
median21784432
Q357491000
95-th percentile176116785
Maximum760505847
Range760505734
Interquartile range (IQR)53698812

Descriptive statistics

Standard deviation66528502.32
Coefficient of variation (CV)1.471133634
Kurtosis15.85410786
Mean45222609.81
Median Absolute Deviation (MAD)43527387.83
Skewness3.22843245
Sum2.069838851e+11
Variance4.42604162e+15
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5000000 4 0.1%
 
3000000 3 0.1%
 
94061311 3 0.1%
 
8000000 3 0.1%
 
7000000 3 0.1%
 
47000000 3 0.1%
 
5773519 3 0.1%
 
144512310 3 0.1%
 
34964818 3 0.1%
 
177343675 3 0.1%
 
Other values (4422) 4546 90.1%
 
(Missing) 466 9.2%
 
ValueCountFrequency (%) 
113 1 < 0.1%
 
162 1 < 0.1%
 
423 1 < 0.1%
 
576 1 < 0.1%
 
703 1 < 0.1%
 
ValueCountFrequency (%) 
760505847 1 < 0.1%
 
658672302 1 < 0.1%
 
652177271 1 < 0.1%
 
623279547 2 < 0.1%
 
533316061 1 < 0.1%
 

imdb_score
Real number (ℝ≥0)

Distinct count78
Unique (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.442137616
Minimum1.6
Maximum9.5
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum1.6
5-th percentile4.4
Q15.8
median6.6
Q37.2
95-th percentile8.09
Maximum9.5
Range7.9
Interquartile range (IQR)1.4

Descriptive statistics

Standard deviation1.125115866
Coefficient of variation (CV)0.1746494615
Kurtosis0.9356915064
Mean6.442137616
Median Absolute Deviation (MAD)0.8730186468
Skewness-0.7414713363
Sum32487.7
Variance1.265885711
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[1.6 2.65 3.25 4.05 4.75 ... 7.85 8.15 8.55 8.85 9.5 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6.7 223 4.4%
 
6.6 201 4.0%
 
7.2 195 3.9%
 
6.5 186 3.7%
 
6.4 185 3.7%
 
7.3 184 3.6%
 
7 184 3.6%
 
7.1 181 3.6%
 
6.8 181 3.6%
 
6.1 179 3.5%
 
Other values (68) 3144 62.3%
 
ValueCountFrequency (%) 
1.6 1 < 0.1%
 
1.7 1 < 0.1%
 
1.9 3 0.1%
 
2 2 < 0.1%
 
2.1 3 0.1%
 
ValueCountFrequency (%) 
9.5 1 < 0.1%
 
9.3 1 < 0.1%
 
9.2 1 < 0.1%
 
9.1 3 0.1%
 
9 3 0.1%
 

language
Categorical

Distinct count48
Unique (%)1.0%
Missing12
Missing (%)0.2%
Memory size39.5 KiB
English
4704
French
 
73
Spanish
 
40
Hindi
 
28
Mandarin
 
26
Other values (42)
 
160
ValueCountFrequency (%) 
English 4704 93.3%
 
French 73 1.4%
 
Spanish 40 0.8%
 
Hindi 28 0.6%
 
Mandarin 26 0.5%
 
German 19 0.4%
 
Japanese 18 0.4%
 
Cantonese 11 0.2%
 
Italian 11 0.2%
 
Russian 11 0.2%
 
Other values (37) 90 1.8%
 
(Missing) 12 0.2%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length10
Mean length6.971247273
Min length3
Scatter

movie_fb_likes
Real number (ℝ≥0)

ZEROS
Distinct count876
Unique (%)17.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7525.964505
Minimum0
Maximum349000
Zeros2181
Zeros (%)43.2%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median166
Q33000
95-th percentile40000
Maximum349000
Range349000
Interquartile range (IQR)3000

Descriptive statistics

Standard deviation19320.44511
Coefficient of variation (CV)2.567171968
Kurtosis41.33443692
Mean7525.964505
Median Absolute Deviation (MAD)11022.02801
Skewness5.05892689
Sum37953439
Variance373279599.2
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0.000e+00 1.000e+00 9.250e+01 4.915e+02 9.995e+02 ... 6.550e+04 8.400e+04 1.235e+05 1.980e+05 3.490e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 2181 43.2%
 
1000 109 2.2%
 
11000 83 1.6%
 
10000 81 1.6%
 
12000 62 1.2%
 
13000 58 1.2%
 
2000 56 1.1%
 
15000 53 1.1%
 
14000 50 1.0%
 
16000 47 0.9%
 
Other values (866) 2263 44.9%
 
ValueCountFrequency (%) 
0 2181 43.2%
 
2 2 < 0.1%
 
3 1 < 0.1%
 
4 5 0.1%
 
5 2 < 0.1%
 
ValueCountFrequency (%) 
349000 1 < 0.1%
 
199000 1 < 0.1%
 
197000 1 < 0.1%
 
191000 1 < 0.1%
 
190000 1 < 0.1%
 
Distinct count4919
Unique (%)97.5%
Missing0
Missing (%)0.0%
Memory size39.5 KiB
http://www.imdb.com/title/tt0077651/?ref_=fn_tt_tt_1
 
3
http://www.imdb.com/title/tt2224026/?ref_=fn_tt_tt_1
 
3
http://www.imdb.com/title/tt0360717/?ref_=fn_tt_tt_1
 
3
http://www.imdb.com/title/tt1976009/?ref_=fn_tt_tt_1
 
3
http://www.imdb.com/title/tt2638144/?ref_=fn_tt_tt_1
 
3
Other values (4914)
5028
ValueCountFrequency (%) 
http://www.imdb.com/title/tt0077651/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt2224026/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt0360717/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt1976009/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt2638144/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt0232500/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt3332064/?ref_=fn_tt_tt_1 3 0.1%
 
http://www.imdb.com/title/tt1267297/?ref_=fn_tt_tt_1 2 < 0.1%
 
http://www.imdb.com/title/tt0318974/?ref_=fn_tt_tt_1 2 < 0.1%
 
http://www.imdb.com/title/tt1502712/?ref_=fn_tt_tt_1 2 < 0.1%
 
Other values (4909) 5016 99.5%
 
ValueCountFrequency (%) 
http 5043 100.0%
 
ValueCountFrequency (%) 
www.imdb.com 5043 100.0%
 
ValueCountFrequency (%) 
/title/tt3332064/ 3 0.1%
 
/title/tt0360717/ 3 0.1%
 
/title/tt0077651/ 3 0.1%
 
/title/tt2638144/ 3 0.1%
 
/title/tt0232500/ 3 0.1%
 
/title/tt1976009/ 3 0.1%
 
/title/tt2224026/ 3 0.1%
 
/title/tt1666335/ 2 < 0.1%
 
/title/tt0087800/ 2 < 0.1%
 
/title/tt0299977/ 2 < 0.1%
 
Other values (4909) 5016 99.5%
 
ValueCountFrequency (%) 
ref_=fn_tt_tt_1 5043 100.0%
 
ValueCountFrequency (%) 
5043 100.0%
 

movie_title
Categorical

HIGH CARDINALITY
Distinct count4917
Unique (%)97.5%
Missing0
Missing (%)0.0%
Memory size39.5 KiB
Home 
 
3
King Kong 
 
3
The Fast and the Furious 
 
3
Pan 
 
3
Halloween 
 
3
Other values (4912)
5028
ValueCountFrequency (%) 
Home  3 0.1%
 
King Kong  3 0.1%
 
The Fast and the Furious  3 0.1%
 
Pan  3 0.1%
 
Halloween  3 0.1%
 
Ben-Hur  3 0.1%
 
Victor Frankenstein  3 0.1%
 
The Legend of Tarzan  2 < 0.1%
 
Snitch  2 < 0.1%
 
Aloha  2 < 0.1%
 
Other values (4907) 5016 99.5%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length87
Mean length16.54967281
Min length2
Scatter

num_critic_for_reviews
Real number (ℝ≥0)

Distinct count529
Unique (%)10.5%
Missing50
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean140.194272
Minimum1
Maximum813
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile9
Q150
median110
Q3195
95-th percentile387
Maximum813
Range812
Interquartile range (IQR)145

Descriptive statistics

Standard deviation121.6016754
Coefficient of variation (CV)0.8673797701
Kurtosis2.91341641
Mean140.194272
Median Absolute Deviation (MAD)92.35207408
Skewness1.5165327
Sum699990
Variance14786.96746
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 43 0.9%
 
9 37 0.7%
 
5 36 0.7%
 
10 35 0.7%
 
8 35 0.7%
 
12 34 0.7%
 
81 33 0.7%
 
16 33 0.7%
 
43 31 0.6%
 
29 30 0.6%
 
Other values (518) 4646 92.1%
 
(Missing) 50 1.0%
 
ValueCountFrequency (%) 
1 43 0.9%
 
2 26 0.5%
 
3 24 0.5%
 
4 29 0.6%
 
5 36 0.7%
 
ValueCountFrequency (%) 
813 1 < 0.1%
 
775 1 < 0.1%
 
765 1 < 0.1%
 
750 2 < 0.1%
 
739 1 < 0.1%
 

num_user_for_reviews
Real number (ℝ≥0)

Distinct count955
Unique (%)18.9%
Missing21
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean272.7708084
Minimum1
Maximum5060
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile10
Q165
median156
Q3326
95-th percentile907.8
Maximum5060
Range5059
Interquartile range (IQR)261

Descriptive statistics

Standard deviation377.9828856
Coefficient of variation (CV)1.385716044
Kurtosis26.43829739
Mean272.7708084
Median Absolute Deviation (MAD)228.8571855
Skewness4.121475159
Sum1369855
Variance142871.0618
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 51 1.0%
 
3 33 0.7%
 
26 32 0.6%
 
2 32 0.6%
 
10 29 0.6%
 
6 28 0.6%
 
50 26 0.5%
 
32 25 0.5%
 
8 25 0.5%
 
31 24 0.5%
 
Other values (944) 4717 93.5%
 
ValueCountFrequency (%) 
1 51 1.0%
 
2 32 0.6%
 
3 33 0.7%
 
4 23 0.5%
 
5 19 0.4%
 
ValueCountFrequency (%) 
5060 1 < 0.1%
 
4667 1 < 0.1%
 
4144 1 < 0.1%
 
3646 1 < 0.1%
 
3597 1 < 0.1%
 

num_voted_users
Real number (ℝ≥0)

Distinct count4826
Unique (%)95.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83668.16082
Minimum5
Maximum1689764
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum5
5-th percentile514.6
Q18593.5
median34359
Q396309
95-th percentile332254.9
Maximum1689764
Range1689759
Interquartile range (IQR)87715.5

Descriptive statistics

Standard deviation138485.2568
Coefficient of variation (CV)1.655172714
Kurtosis24.44552017
Mean83668.16082
Median Absolute Deviation (MAD)84252.04372
Skewness4.029871144
Sum421938535
Variance1.917816635e+10
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[5.000000e+00 1.240000e+02 6.095000e+02 1.669500e+03 6.382500e+03 ... 2.222510e+05 3.340960e+05 5.374305e+05 8.868355e+05 1.689764e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
57 5 0.1%
 
6 4 0.1%
 
6025 3 0.1%
 
374 3 0.1%
 
53 3 0.1%
 
3119 3 0.1%
 
62 3 0.1%
 
162 3 0.1%
 
2541 3 0.1%
 
8 3 0.1%
 
Other values (4816) 5010 99.3%
 
ValueCountFrequency (%) 
5 2 < 0.1%
 
6 4 0.1%
 
7 2 < 0.1%
 
8 3 0.1%
 
10 1 < 0.1%
 
ValueCountFrequency (%) 
1689764 1 < 0.1%
 
1676169 1 < 0.1%
 
1468200 1 < 0.1%
 
1347461 1 < 0.1%
 
1324680 1 < 0.1%
 

plot_keywords
Categorical

MISSING
HIGH CARDINALITY
Distinct count4761
Unique (%)94.4%
Missing153
Missing (%)3.0%
Memory size39.5 KiB
based on novel
 
4
alien friendship|alien invasion|australia|flying car|mother daughter relationship
 
3
eighteen wheeler|illegal street racing|truck|trucker|undercover cop
 
3
animal name in title|ape abducts a woman|gorilla|island|king kong
 
3
one word title
 
3
Other values (4755)
4874
ValueCountFrequency (%) 
based on novel 4 0.1%
 
alien friendship|alien invasion|australia|flying car|mother daughter relationship 3 0.1%
 
eighteen wheeler|illegal street racing|truck|trucker|undercover cop 3 0.1%
 
animal name in title|ape abducts a woman|gorilla|island|king kong 3 0.1%
 
one word title 3 0.1%
 
halloween|masked killer|michael myers|slasher|trick or treat 3 0.1%
 
1940s|child hero|fantasy world|orphan|reference to peter pan 3 0.1%
 
assistant|experiment|frankenstein|medical student|scientist 3 0.1%
 
sandman|spider man|symbiote|venom|villain 2 < 0.1%
 
audition|friendship|graduation|high school graduation|love 2 < 0.1%
 
Other values (4750) 4861 96.4%
 
(Missing) 153 3.0%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length149
Mean length50.92742415
Min length2
Scatter

title_year
Real number (ℝ≥0)

MISSING
Distinct count92
Unique (%)1.8%
Missing108
Missing (%)2.1%
Infinite0
Infinite (%)0.0%
Mean2002.470517
Minimum1916
Maximum2016
Zeros0
Zeros (%)0.0%
Memory size39.5 KiB
Mini histogram

Quantile statistics

Minimum1916
5-th percentile1979
Q11999
median2005
Q32011
95-th percentile2015
Maximum2016
Range100
Interquartile range (IQR)12

Descriptive statistics

Standard deviation12.47459892
Coefficient of variation (CV)0.006229604289
Kurtosis7.439212616
Mean2002.470517
Median Absolute Deviation (MAD)8.554733481
Skewness-2.29227335
Sum9882192
Variance155.6156182
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2009 260 5.2%
 
2014 252 5.0%
 
2006 239 4.7%
 
2013 237 4.7%
 
2010 230 4.6%
 
2015 226 4.5%
 
2008 225 4.5%
 
2011 225 4.5%
 
2005 221 4.4%
 
2012 221 4.4%
 
Other values (81) 2599 51.5%
 
ValueCountFrequency (%) 
1916 1 < 0.1%
 
1920 1 < 0.1%
 
1925 1 < 0.1%
 
1927 1 < 0.1%
 
1929 2 < 0.1%
 
ValueCountFrequency (%) 
2016 106 2.1%
 
2015 226 4.5%
 
2014 252 5.0%
 
2013 237 4.7%
 
2012 221 4.4%
 

Correlations

Missing values

Sample

First rows

actor_1_fb_likesactor_1_nameactor_2_fb_likesactor_2_nameactor_3_fb_likesactor_3_nameaspect_ratiobudgetcast_total_fb_likescolorcontent_ratingcountrydirector_fb_likesdirector_namedurationfacenumber_in_postergenresgrossimdb_scorelanguagemovie_fb_likesmovie_imdb_linkmovie_titlenum_critic_for_reviewsnum_user_for_reviewsnum_voted_usersplot_keywordstitle_year
01000.0CCH Pounder936.0Joel David Moore855.0Wes Studi1.78237000000.04834ColorPG-13USA0.0James Cameron178.00.0Action|Adventure|Fantasy|Sci-Fi760505847.07.9English33000http://www.imdb.com/title/tt0499549/?ref_=fn_tt_tt_1Avatar723.03054.0886204avatar|future|marine|native|paraplegic2009.0
140000.0Johnny Depp5000.0Orlando Bloom1000.0Jack Davenport2.35300000000.048350ColorPG-13USA563.0Gore Verbinski169.00.0Action|Adventure|Fantasy309404152.07.1English0http://www.imdb.com/title/tt0449088/?ref_=fn_tt_tt_1Pirates of the Caribbean: At World's End302.01238.0471220goddess|marriage ceremony|marriage proposal|pirate|singapore2007.0
211000.0Christoph Waltz393.0Rory Kinnear161.0Stephanie Sigman2.35245000000.011700ColorPG-13UK0.0Sam Mendes148.01.0Action|Adventure|Thriller200074175.06.8English85000http://www.imdb.com/title/tt2379713/?ref_=fn_tt_tt_1Spectre602.0994.0275868bomb|espionage|sequel|spy|terrorist2015.0
327000.0Tom Hardy23000.0Christian Bale23000.0Joseph Gordon-Levitt2.35250000000.0106759ColorPG-13USA22000.0Christopher Nolan164.00.0Action|Thriller448130642.08.5English164000http://www.imdb.com/title/tt1345836/?ref_=fn_tt_tt_1The Dark Knight Rises813.02701.01144337deception|imprisonment|lawlessness|police officer|terrorist plot2012.0
4131.0Doug Walker12.0Rob WalkerNaNNaNNaNNaN143NaNNaNNaN131.0Doug WalkerNaN0.0DocumentaryNaN7.1NaN0http://www.imdb.com/title/tt5289954/?ref_=fn_tt_tt_1Star Wars: Episode VII - The Force AwakensNaNNaN8NaNNaN
5640.0Daryl Sabara632.0Samantha Morton530.0Polly Walker2.35263700000.01873ColorPG-13USA475.0Andrew Stanton132.01.0Action|Adventure|Sci-Fi73058679.06.6English24000http://www.imdb.com/title/tt0401729/?ref_=fn_tt_tt_1John Carter462.0738.0212204alien|american civil war|male nipple|mars|princess2012.0
624000.0J.K. Simmons11000.0James Franco4000.0Kirsten Dunst2.35258000000.046055ColorPG-13USA0.0Sam Raimi156.00.0Action|Adventure|Romance336530303.06.2English0http://www.imdb.com/title/tt0413300/?ref_=fn_tt_tt_1Spider-Man 3392.01902.0383056sandman|spider man|symbiote|venom|villain2007.0
7799.0Brad Garrett553.0Donna Murphy284.0M.C. Gainey1.85260000000.02036ColorPGUSA15.0Nathan Greno100.01.0Adventure|Animation|Comedy|Family|Fantasy|Musical|Romance200807262.07.8English29000http://www.imdb.com/title/tt0398286/?ref_=fn_tt_tt_1Tangled324.0387.029481017th century|based on fairy tale|disney|flower|tower2010.0
826000.0Chris Hemsworth21000.0Robert Downey Jr.19000.0Scarlett Johansson2.35250000000.092000ColorPG-13USA0.0Joss Whedon141.04.0Action|Adventure|Sci-Fi458991599.07.5English118000http://www.imdb.com/title/tt2395427/?ref_=fn_tt_tt_1Avengers: Age of Ultron635.01117.0462669artificial intelligence|based on comic book|captain america|marvel cinematic universe|superhero2015.0
925000.0Alan Rickman11000.0Daniel Radcliffe10000.0Rupert Grint2.35250000000.058753ColorPGUK282.0David Yates153.03.0Adventure|Family|Fantasy|Mystery301956980.07.5English10000http://www.imdb.com/title/tt0417741/?ref_=fn_tt_tt_1Harry Potter and the Half-Blood Prince375.0973.0321795blood|book|love|potion|professor2009.0

Last rows

actor_1_fb_likesactor_1_nameactor_2_fb_likesactor_2_nameactor_3_fb_likesactor_3_nameaspect_ratiobudgetcast_total_fb_likescolorcontent_ratingcountrydirector_fb_likesdirector_namedurationfacenumber_in_postergenresgrossimdb_scorelanguagemovie_fb_likesmovie_imdb_linkmovie_titlenum_critic_for_reviewsnum_user_for_reviewsnum_voted_usersplot_keywordstitle_year
5033291.0Shane Carruth45.0David Sullivan8.0Casey Gooden1.857000.0368ColorPG-13USA291.0Shane Carruth77.00.0Drama|Sci-Fi|Thriller424760.07.0English19000http://www.imdb.com/title/tt0390384/?ref_=fn_tt_tt_1Primer143.0371.072639changing the future|independent film|invention|nonlinear timeline|time travel2004.0
50340.0Ian Gamazon0.0Edgar Tancangco0.0Quynn TonNaN7000.00ColorNot RatedPhilippines0.0Neill Dela Llana80.00.0Thriller70071.06.3English74http://www.imdb.com/title/tt0428303/?ref_=fn_tt_tt_1Cavite35.035.0589jihad|mindanao|philippines|security guard|squatter2005.0
5035121.0Carlos Gallardo20.0Peter Marquardt6.0Consuelo Gómez1.377000.0147ColorRUSA0.0Robert Rodriguez81.00.0Action|Crime|Drama|Romance|Thriller2040920.06.9Spanish0http://www.imdb.com/title/tt0104815/?ref_=fn_tt_tt_1El Mariachi56.0130.052055assassin|death|guitar|gun|mariachi1992.0
503645.0Richard Jewell44.0John Considine2.0Sara StepnickaNaN3250.093ColorPG-13USA2.0Anthony Vallone84.00.0Crime|DramaNaN7.8English4http://www.imdb.com/title/tt0430371/?ref_=fn_tt_tt_1The Mongol KingNaN1.036jewell|mongol|nostradamus|stepnicka|vallone2005.0
5037296.0Kerry Bishé205.0Caitlin FitzGerald133.0Daniella PinedaNaN9000.0690ColorNot RatedUSA0.0Edward Burns95.01.0Comedy|Drama4584.06.4English413http://www.imdb.com/title/tt1880418/?ref_=fn_tt_tt_1Newlyweds14.014.01338written and directed by cast member2011.0
5038637.0Eric Mabius470.0Daphne Zuniga318.0Crystal LoweNaNNaN2283ColorNaNCanada2.0Scott Smith87.02.0Comedy|DramaNaN7.7English84http://www.imdb.com/title/tt3000844/?ref_=fn_tt_tt_1Signed Sealed Delivered1.06.0629fraud|postal worker|prison|theft|trial2013.0
5039841.0Natalie Zea593.0Valorie Curry319.0Sam Underwood16.00NaN1753ColorTV-14USANaNNaN43.01.0Crime|Drama|Mystery|ThrillerNaN7.5English32000http://www.imdb.com/title/tt2071645/?ref_=fn_tt_tt_1The Following43.0359.073839cult|fbi|hideout|prison escape|serial killerNaN
50400.0Eva Boehnke0.0Maxwell Moody0.0David ChandlerNaN1400.00ColorNaNUSA0.0Benjamin Roberds76.00.0Drama|Horror|ThrillerNaN6.3English16http://www.imdb.com/title/tt2107644/?ref_=fn_tt_tt_1A Plague So Pleasant13.03.038NaN2013.0
5041946.0Alan Ruck719.0Daniel Henney489.0Eliza Coupe2.35NaN2386ColorPG-13USA0.0Daniel Hsia100.05.0Comedy|Drama|Romance10443.06.3English660http://www.imdb.com/title/tt2070597/?ref_=fn_tt_tt_1Shanghai Calling14.09.01255NaN2012.0
504286.0John August23.0Brian Herzlinger16.0Jon Gunn1.851100.0163ColorPGUSA16.0Jon Gunn90.00.0Documentary85222.06.6English456http://www.imdb.com/title/tt0378407/?ref_=fn_tt_tt_1My Date with Drew43.084.04285actress name in title|crush|date|four word title|video camera2004.0